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ABSTRACT 


The objective of this study was to evaluate a benchmarking approach for 
predicting smoke constituent machine yields for conventional cigarette brands 
from worldwide markets. Results for ISO smoke yields thus support the validity 
of a smoke constituent benchmark approach when brands, for which yields are to 
be predicted, have design characteristics within boundaries established by the 
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exploratory brands-T he use of conventional cigarette design parameters_ 

including tobacco blend, filter construction and ventilation, cigarette paper 
porosity, cigarette length and diameter, and processed tobacco components has 
led to a wide array of marketplace brand styles with different smoke yields. 

When cigarette design variants are coupled with regional agriculture and growing 
condition variations, the impact on smoke chemistry can be complex. T Of the 
brands studied, the majority were. “American Blended” cigarettes containing 
biends of primarily bright, burley, and oriental tobaccos. In general, the yields of 
mainstream smoke constituents determined under standard ISO machine 
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smoking conditions were well described by weighted least squares regression 
relationships with ISO tar (R 2 > 0.80 and coefficient p-values < 0.05). These 
relationships were validated with a subset of brands. Greater than 90% of the 
validation brands’ smoke chemistry was within the 95% prediction intervals. 

Brands with constituent yields beyond the predictive intervals were primarily 
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those with non-blended tobaccos. Additionally, re gional agriculture and growina 
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conditions may have contributed to smoke metals yield variation, as seen bv-the 
cadmium and mercury yields for two regional brands. The potential impact of 
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carbon in filters on vapor phase smoke constituent yields was addressed with 
multiple regression. Inclusion of a carbon factor variable improved the functional 
relationships between the yields of vapor phase constituents and ISO tar, as 
evidenced by regression statistics and reduced prediction errors. Functional 

relationships wer e .developed f or nitrogen oxides and tobacco specific_ 

nitrosamine smoke yields by jncludinQ.t obacco filler nitrate or tobacco specific 
nitrosamine concentratio n.factors in the respective f unctional relationships. _ 
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Average differences between measured and predicted yields were generally 
within the range of one to two measurement standard deviations.,_ 


Key Words; mainstream smoke constituents, smoke chemistry, benchmarking 
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Abbreviations; ISO, International Organization for Standardization; IEC, International 
Electrotechnical Commission; IARC, International Agency for Research on Cancer; 
CORESTA, Cooperation Center for Scientific Research Relative to Tobacco; ASTM, 
American Society for Testing and Materials; NNN, N'-nitrosonornicotine; NAT, N'- 
nitrosoanatabine; NAB, N’-nitrosoanabasine; NNK, 4-{N-methyl-N-nitrosamino)-1-(3- 
pyridyl)-1-butanone 
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INTRODUCTION 

The tobacco industry has provided cigarette smoke yield tar, nicotine, and 
carbon monoxide data for various regulatory agencies for many years. More 
recently, regulations have been adopted or proposed for reporting the yields of 
specific smoke chemicals found in mainstream or sidestream cigarette smoke 
(Brazil, 2001; World Health Organization, 2000a; and International Digest of 
Health Legislation, 2001). Cigarette smoke is a complex chemical mixture. 

Nearly 4000 chemical constituents have been identified in mainstream cigarette 
smoke (Dube and Green, 1982, and Roberts, 1988). These individual chemical 
compounds and elements, as opposed to tar which is a composite of chemicals, 
are referred to as smoke constituents. Considerable research has been reported 
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on cigarette smoke constituents identified a^potentiaNy toxic (IARC 4 1986 and_ 
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1999* Reports of Surgeon Generali 989^Hoffmann A 1993 , for example) . Smith j Deleted:, ] 

et al. (1997, 2000, and 2001) published comprehensive literature reviews of 
IARC Group 1,2A, and 2B carcinogens found in mainstream cigarette smoke. 

Numerous publications are available that review standard smoking machine 
testing methods, alternative smoking methods, and the relevancy of any smoking 
machine testing protocol to smoking and health issues (World Health 
Organization. 2000b: Boraerding, 1997; Baker, 1999: National Cancer Institute- 
1996 and 2001: Federal Register. 1997). 

The reporting of individual smoke constituent yields in cigarette smoke raises 
a number of issues that remain to be resolved. For example, there are no 
internationally recognized and standardized methods for the generation, 
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collection, and analysis of mainstream or sidestream smoke constituents. For 
smoke generation and collection, cigarette manufacturers in the United States 
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are required to us^the Federal Tr_ade_Commissi_o_n_(F_TC)_smpke_method_for tar, 
nicotine and carbon monoxide reporting (Federal Register, 1967 and 1980). 

Internationally, the Internationa! Organization for Standardization (ISO) method is 
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recognized as the standard for^eterminmgcigarette tar,nicotine, and carbon_ 

monoxide yields (ISO 1999, 2000 a, b and c). Both the FTC and ISO standard 
methods are executed with smoking parameters of 35-mL puff volumes, 60-s puff 
intervals, and 2-s puff durations. Neither method incorporates other smoke 
constituent analysis guidelines.. The Canadian Government_has_stipu]ated both , 
machine smoking conditions and smoke constituent analytical methods in its 
Tobacco Act Reporting Regulation for cigarette products marketed in Canada 
(Canady 2000a)_ .This is the only known requirement fqr specific analytical^ 

\ 
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methods within a regulation. ,_ _ \ 


In contrast to tar, nicotine and carbon monoxide yield testing, considerable 
differences may exist among testing laboratories in practices for collection and 
analysis of smoke constituents (Purkls. 2001} . This lack of standardized test 
guidelines can lead to greater variation in accuracy and precision than might 
otherwise be expected. A CORESTA Task Force Special Analytes Group was 
convened in 1999 with the objective of standardizing methodologies for the 
determination of smoke constituents through collaborative testing (CORESTA, 
1999). This is the only known collaborative endeavor in which method 
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require nicotine yield reporting based 
on smoking machine conditions of 45- 
mL puff volume, 2-s puff duration, and 
30-s puff interval, with 60% of filter 
ventilation holes blocked. Canadian 
regulations require reporting smoke 
constituent yields using both the 
standard ISO smoking regimen and a 
more intense smoking regimen of 55- 
mL puff volume, 2-s puff duration, and 
30-s puff interval with total blockage 
of filter ventilation holes. _ 
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standardization for smoke constituents is being internationally addressed. 

Method standardization is a lengthy process requiring multiple laboratory 
participation. ^Comparing trace quantity smoke constituent yields from different 

laboratories .r emains a significant challeng e without standard methods. _ 

Laboratory availability and capacity are other issues to be recognized as the 
need for reporting of smoke constituent yields expands. Worldwide, there is a 
limited number of accredited laboratories with the capability and capacity to 
handle multiple smoke constituent analyses under a variety of smoking 
conditions for a large number of brands. Laboratory capacity may develop to 

meet demand if smoke constituent reporting regulations increase. The_ 

objective of this study was to demonstrate the predictability of mainstream smoke 
constituent yields. An alternative to measuring all constituent yields for each 
brand in a market is to develop functional relationships between the yields of 
standard cigarette yield parameters and smoke constituents. In this study, 

relationships were developed from one representative sample set. of cigarette_ 

brands and then applied to a test series of brands. Benchmarking , as used here, 
encompasses the testing ofa set of cigarette brand samples having cigarette_ 
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design features that are representative of the marketplace. Functional 
relationships are then established between the yields of each smoke constituent 
and the predicting variables. The predicting variables are generally the 
fundamental machine yield parameters of mainstream smoke tar, nicotine or 
carbon monoxide determined using standardized smoking methods. These 
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functional relationships were subsequently used to predict constituent yields for 
other brands that were not tested. 


This study was not the first to demonstrate the potential utility of a 
benchmarking approach. It was unique, however, in that the.studv..included 
brands marketed in a wide distribution of geographical region, .Benchmarking 
offers a technical alternative in providing reliable smoke constituent yield 

, \ Deleted: one 

information. This article reports benchmarking results at standard ISO s moking 

machine conditions. , There are numerous published examples of the use of_ 

fundamental yield parameters to predict, compare, or rank other smoke 
chemistry yields. Several studies included machine smoking conditions other 
than FTC or ISO. Young et al. (1981) evaluated the adequacy of tar for 
predicting carbon monoxide, hydrogen cyanide and acrolein. Their conclusion at 
that time was that tar was an inadequate predictor variable for these constituents. 

In a later study, Rickert et al. (1983) compared tar, nicotine, and carbon 
monoxide yields of 26 brands, with and without filter ventilation blocking, at 
several smoking machine conditions. They concluded that the relative ranking of 
cigarette yields at three smoking conditions, with 100% filter ventilation blocking 
at the “intense" condition, was essentially maintained. Tar, nicotine, and carbon 
monoxide at two alternative smoking conditions could also be predicted from 
yields at standard smoking conditions. Jenkins et al. (1983) investigated 
mainstream smoke yields of tar, nicotine, carbon monoxide, nitrogen oxides, and 
hydrogen cyanide at FTC smoking conditions for 32 cigarette brands over a wide 
tar range. They found that the smoke yield of the constituents studied could be 
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estimated within 50% if the brand's tar yield were known. Chepiga et al. (2000) 

reported mainstream smoke constituent yields for a 1995 sampling of 29.U S._ 

market cigarette brands and two University of Kentucky reference cigarettes. 

The analytical work was part of a study to assess smoke chemistry and smoke 
condensate mutagenicity relationships for commercial brands and reference 
cigarettes. For all brands tested at FTC smoking conditions, mainstream smoke 
constituent yields increased with increasing tar. Linear correlations between 
mainstream tar and constituent yields were significant. The authors noted that 
this was particularly relevant because of the wide range of design features of the 
cigarette brands studied. Seeman et al. (2003) compared mainstream tar and 


sample. 

An extensive benchmark study was done cooperatively with the 
Massachusetts Department of Public Health (MDPH) by four U.S. tobacco 
companies in 1999 (Borgerding et al., 2000). Mainstream smoke constituent 
yields of brands representative of the U.S. market were determined at smoking 
conditions defined by theJi/IDPH^ puff_volume L [nteiya], and duration of 45 mL, 30 
s, and 2 s, respectively, with filter ventilation 50% blocked (Massachusetts. 

1997) . Functional relationships were developed between tar, nicotine, and 
carbon monoxide, as measured at either standard FTC or^DPH -defined _ 
smoking conditions, and individual smoke constituent yields determined at 
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MDPH-defined conditions. These relationships Jiad the potential for predic ting 
smoke constituent yields for other similarly designed brands. 
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An indirect example of benchmarking is contained in the Canada Tobacco Act 
Reporting Regulation. The regulation allows for use of linear functional 


relationships between tar or nicotine and each of the smoke constituents 
specified in the regulation (Canada. 2000b). Predictive models .could then be 
used for smoke chemistry yields for.either ISO or_Health Canada alternative 
smoking conditions specified in the regulation. 

An important element of the benchmarking approach is that brands for which 
smoke constituent yields are to be predicted must have design parameters that 
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are within the range of the design parameters of the brands selected to represent 
the market. Design parameters ranges includeyarious-tobacco blends, .weights, 
filter ventilations, and cigarette paper porositieg. for example. T he study reported 
here and the particular functional relationships developed are known to be 



applicable solely to current and future brands of Philip Morris USA Inc. and Philip 
Morris International Inc, that are within the design range defined bv the study 
cigarettes. The study approach and the statistical treatment of data are 
.applicable to other studies of cigarette mainstream smoke constituent 

yieldsJWATERIALS AND METHODS_ 

Cigarettes 

Philip Morris commercial brands from various international market regions 
were tested. Forty-eight filtered brands were selected to include the majority of 
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brands were blended cigarettes consisting primarily of bright (“Virginia" or flue- 
cured), burley (air-cured), and oriental tobaccos. Three brands contained 
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primarily bright tobaccos. Nine brands included to mg (Frank: do we _ 
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need to be this specific?) of activated carbon ialso referred to as activated _ 

charcoal) in their filter construction. ISO tar yields ranged from approximately 1 
to 14 mg per cigarette. Brands were sampled at 14 production facilities in late 
1999. Table 1 is a summary of characteristics for these cigarette brands. 

The study used a split-sample analysis plan. Forty cigarette samples, including 
39 commercial brands and the University of Kentucky T 1R4F reference cigarette. _ 

(Davis et al.. 1984) made up the exploratory sample group. The other 9 
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commercial brands and the 1R4F T r eference cigarette were in the validation_ 

sample group. The validation group included a range of tar yields and region 
representation, and was tested after completion of testing of the exploratory 
group. The exploratory group data were used to develop functional relationships. 

The validation group data provided independent assessments of the predicting 
strength of the functional relationships. The,1 R4F reference£ig_arette_wa_s__ 

1 

included with each sample group for consistency comparisons. /; 
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Mainstream Smoke Collection Methods // 

/1 

The study was performed at JSQ s moking machine conditions of 35-mL puff /! > 
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volume. 2-s buff duration. 60-s puff interval , Cigarette samples we_re cp_ndit|o_n_ed_ / • 

/ 

following ISO Standard 3402 (ISO 1999). ^Twenty replicates were analyzed for 
tar, nicotine, and carbon monoxide yields. Seven replicates were analyzed for 
mainstream smoke constituent yields. *_ 
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Mainstream Smoke Constituent and Tobacco Filler Analyses 

An ISO/IEC 17025-accredited commercial laboratory conducted the machine 
smoking and constituent analyses. The constituents (Table 2) analyzed for this 
study were representative of those included in several current or proposed 
government regulations (Brazil. 2001: Canada. 2000a: and Massachusetts. 
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199S) . Analytical methods used by the laboratory were Ihe same as those found, { inserted: 2 ooj" 


in current Health Canada regulations (Canady, 2000b). Tobacco filter from_each_ \ 
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cigarette brand was also analyzed for nitrate and tobacco specific nitrosamines \ 
(TSNAXNNN, NAT, NAB, and NNK, Outlier statistics_(ASTM L 1994)[were applied 

v. 

to smoke constituent yield and filler chemistry data. Smoke constituents 
analyzed , the number of cigarettes per replicate- and the respective Health 
Canada mainstream smoke and whole tobacco analysis methods are found in 
Table 2. 
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Statistics 

A functional relationship is a mathematical equation that specifies the 
quantitative relationship between a dependent variable and one or more 
independent variables. A statistically significant junctional relationship does not , -d/- 
rove cause and effect, but only establishes the degree of association between 


the magnitudes.of a dependenLand independent variablefsl. To prove 
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causality, the investigator must conduct additional studies specifically designed \ 


for the purpose fKeiinbaum et al 19881, T his equation may allow.for predicting 
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values of a dependent variable when values of independent variables are known. ^Y 
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Regression is the computational process used to develop functional relationships \ 
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from observed values of the dependent and independent variables. A key factor 
in developing functional relationships for smoke constituents was the selection of 
representative cigarette brands for the exploratory sample set. These brands 
and their design features established the boundary conditions for future 
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prediction of yields for other brands. Valid predictions coulc^be made only for_ 

brands with design characteristics within the boundary of the exploratory set. 

The ranges of these design features that determine the boundary conditions are __ 
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summarized in Table 1. Extrapolation beyond thes e ranges is misuse of the 
functional relationships. A second factor in the benchmark approach was 
selection of the independent variables from which strong functional relationships 
can be developed. The independent variable or variables should be readily 
determined through validated method^ Mainstream tar, nicotine, and carbon 

monoxide yields at ISO conditions were ^suitable independent variable_ 

candidates for functional relationships. 

Relationships between mainstream smoke tar, nicotine, and carbon monoxide 
and individual smoke constituents were investigated using SAS statistical 
software (Release 8.01, SAS Institute, Cary, NC.) and EXCEL 97 (Microsoft, 

Redmond, WA.). The coefficients of determination, R 2 and adjusted R 2 , together 
with other diagnostic parameters were used to assess the strength and statistical 
significance of each functional relationship. These diagnostics included p-value 
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and F-statistics for regression coefficients,root mean square errors (RMSE), and 
predicted residual error sum of squares (PRESS) (Myers, 1990^ and Glantz et al„ 

1990). Each statistic quantifies different aspects of the regression relationship 
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between independent and dependent variables. The larger the value of R 2 and 
the lower the values of RMSE and PRESS, the better a particular relationship 
describes the association between independent and dependent variables and, 
therefore, the stronger its potential p redictive utility. Simple li near, quadratic, and 

multiple regression approaches wer ecompared. Regression with transformed_ 

variables was also .evaluated f or several smoke constituent s that were, weakly _ 

correlated with either tar, nicotine, or carbon monoxide alone. These 
transformed variables incorporated a factor for filler chemistry. As will be seen, 
this approach was used primarily for the yields of mainstream smoke nitrogen 

oxides and tobacco specific nitrosamlnes. ,_ 

Functional relationships were assessed for their capacity to predict the smoke 
constituent yields of the validation brand set, as manifested by prediction errors 
(PE). These are the differences between predicted and measured smoke 
constituent yields. The more representative a functional relationship is of the 
measured values, the lower the prediction errors. It is recognized that there are 
no particular criteria for determining the acceptability of a functional relationship. 

It is a judgement based on the evaluation of the statistical parameters and the 
regression plots. 


Deleted: 2-18-03 
Deleted: 4-G-03 


{ Deleted: L 

1 



{ Deleted: 

_J 

\ Deleted: considered 


\ Deleted; considered 

1 

-{ Deleted: v&ry 

_) 


. ^ Deleted: relationships 


Inserted: relationships. 


Deleted: when filler chemistry was e 
key factor in yields. 


13 


PM3006878269 


Source: https://www.industrydocuments.ucsf.edu/docs/nxlx0001 













Draft: 4-15-03 Confidential - - 1 Peleted: 2 ' la ' 03 

“ | peiete^; 4.6.03 

RESULTS 


Mainstream ISO Smoke Constituent Yields 

Average mainstream smoke tar, nicotine, carbon monoxide and smoke 
constituent yields for each brand in this study and the 1R4F reference cigarette 
are found in Table 3. The results are grouped by exploratory brand and 
validation brand sample sets. Several smoke constituent yields, particularly for 
lower tar yield brands, were below the limit of detection (LOD) or limit of 
quantification (LOQ) established by the contract laboratory. Those constituents 
and respective LOD's and LOQ’s are listed in Table 4. 

Functional Relationships 

fit Regression Approach Evaluation - i * >rmatted: Bulleteand Numbering 


Functional relationships for each mainstream smoke constituent were 


generated by regressing the mean yields of the 39 exploratory brands with the 
mean yields of tar, nicotine, or carbon monoxide, as measured at ISO smoking 
conditions. For the purpose of functional relationship development, analytical 
limits for LOQ and LOD were used instead of omitting low constituent yield 
brands from the analyses. .Elands with higher yield value s would therefore not 
imparl, larger influences because brands with yields lower than measurement 
limits were omitted from the regression analyses. ^Regression coefficients^ 

^diagnostic statistics ^ nd residuals analyses. were examined There were no_ 

substantive differences in linear and quadratic regression statistics. Therefore. 
Jjnear models wer^selected._ 
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One of the basic assumptions for regression-is that residuals.are randomly 
distributed over the range of independent variable fi.e.. homoscedasticitvl 
(Kohler. 1994). However, for most of the smoke constituents-residuals tended 
to increase with increasing tar, nicotine, or carbon monoxide yield. —This 
indicat e*^ that many, constituents Jailed to meet the regression c riterion for 
homogeneous and normally distributed variance. m 

Several approaches can be used to stabilize the variances. These include, for 


example, data transformation or weighted least squares regression (WLS). The 
approach eventually used is frequently determined by trial, WLS was selected 
for this studv-W LS is a modification of standard regression analysis^Once the 
proper weight (Wi) factor is determined that can best correct for the unequal 
distribution of variances, the methodology then determines the regression 
coefficients (a, slope, and b, intercept) by minimizing the sum of the squared 
residual, SS res , 

SSr,,w,(y—b—axi) 

where Wj = weighting factor (Myers, 1990, Ch 7). Data with large error variances 
will contribute less to the SS res with weighting. Weighting is different from 
normalization in which the constituent yields are divided by a user-defined 
constant. In WLS. weighting is not applied directly to the constituent yields but 
only to the residuals as shown in the above equation. 

Several weighting functions were evaluated. The inverse of constituent 
variance, (i.e., 1/cr 2 ), as frequently used, resulted in predictive intervals that 
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decreased,, but also varied irregularly in width, over the range of tar or carbon 
monoxide studied. Those brands with constituent yields below LOD or LOQ 
could not be weighted because of the absence of standard deviations and 
therefore were eliminated by ibis l/c^-W LS approach. An alternative approach 
was us ing the inverse of the independent variable, e.g., 1/tar or 1/CO, as the 
weighting factorjwhich_was .generally proportional to the constituen t yariancesj, _ 
This resulted in/andomly distributed residuals^ jncreasedR^, and reduced_ 
standard errors and PRESS statistics as compared to the unweighted analysis. 

r The resulting weighted regression slop_e_and Intercept 9 ®ner_ajly_less 
sensitive to the weighting, were very similar to those from the unweighted 
analysis. However, the improved regression statistics were.r eflected primarily in 
the error variances, confidence intervals, and predictive intervals (Miller et al., 
1993 and Mandel, 1991). WLS generated predictive intervals with non-constant 
width thate xpanded at higher yields and contracted toward the low yield region^ 
,Thev were more proportionally to the constituent yield dispersion over the tar 
range . This was a more realistic representation of the data used to develop the 
functional relationships in this study. Weighted regression with inverse of 
independent variable weighting was adopted for the development of functional 
relationships. 

_The regression parameters from WLS are shown in Table 5. Regressions 

were not included for nickel, chromium, selenium, and arsenic since their ISO 
smoke yields were below the limits of detection or quantification for the majority 
of brands. J2I Correlation Strength _ 
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Mainstream ISO tar, nicotine, and carbon monoxide were strongly correlated 

with each other and with most mainstream constituents, as indicated by R 2 *_ 

There were exceptions, however. ISO fundamental yield parameters were 
weakly correlated with the yields of nitric oxide, nitrogen oxides, and TSNA with 

R 2 < O.^p. As will be seen later jn thistext, tobacco chemistry can be an_ 

important variable to consider for these smoke constituents to strengthen the 
correlation for predicting their yields. Smoke pH was relatively constant among 
brands. This resulted in very weak correlations and a nearly horizontal 
regression line. This same trend of near-constant smoke pH for various brand 
styles was seen in other analyses (Massachusetts. 1999. 2000, 2001). 

^. jyiain_stream smoke cqns_tituents_tend Jo be associated more wjtheither the_ _ 
particulate phase or the vapor phase. The distinction is not absolut^{Hoffmann, 
1993). Nicotine, polycyclic aromatic hydrocarbons, and TSNA are examples of 
constituents primarily in the particulate phase and, as such, retained by a 
standard Cambridge filter pad (Wartman et at, 19591 . Lower molecular weight 
hydrocarbons such as benzene, 1,3-butadiene, and toluene are primarily in the 
vapor phase. Hydrogen cyanide and ammonia for example, ar ^ found both in the 

vapor and particulate p hases,. Part of mainstream smoke formaldehyde is _ 

retained by the Cambridge filter because of its surface property (Dube et al. 

19821. _ From the statistics in Table 5. ISO. tans area sona bl_e_para_mete[ choice_ 
for linear correlations of ISO constituent yields. This choice is based on 
regression statistics for constituents associated with either the vapor or 
particulate phase. In general, ISO nicotine and carbon monoxide demonstrated 
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marginally weaker association, and therefore poorer predictive ability, with vapor 
and particulate phase constituents. For this study, ISO tar was selected as the 
regressor for functional relationship development and subsequent prediction. It 
was noted before that tar, nicotine and carbon monoxide had been used to 
describe the functional relationships for smoke constituents in a benchmark study 

for the Massachusetts DPH (Borgerding et al., 2000). .Others involved with_ 

cigarette development, consumer exposure research, or regulatory concerns 

^ Deleted: information 

may have different .analysis needs. Carbon monoxide or nicotine would also be 

appropriate regressors for describing the yield relationships for smoke __________ 

_ \ Deleted: are 

constituents as both nicotine and carbon monoxide yields .were, highly correlated - j Dieted- also 

with ta r yields . 

(3) Prediction Error 

To test the reliability of the established linear functional relationships, the 
measured constituent yields for the nine validation brands were compared with 
the predicted yields and the 95% predictive intervals. Table 6 shows the 
prediction errors (PE) using ISO tar. The majority (>90%) of constituents in the 
validation brands were within the 95% predictive intervals. Measured brand 
average constituent yields outside the predictive intervals are highlighted, in Table 
6. No constituents consistently fell outside of the predictive intervals across the 

validation brands, Two brands containing_ ontybright tobacco .(V2_and V_7j_, -' 

tended to be under-predicted for several constituents.^ Differences in cigarette 
smoke composition between all bright and all burley cigarettes have been 
reported (Griest and Guirin, 1977). It was observed that mainstream smoke 
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yields for phenol, cresols, isoprene, and benzo[a]pyrene were higher for bright 
cigarettes than for burley cigarettes. Mainstream smoke from all bright cigarettes 
was reported to be lower in nitrogen oxides and higher in hydrogen cyanide. In 
addition to tobacco blend differences, there are differences in processed tobacco 

Deleted: further 

components in different market regions. Tobaccos from different regions wil|,_be 
impacted further bv agricultural practices and growing conditions. For example, 

, \ Deleted: mostly 

the concentration of metals in tobacco, and consequently smoke, is affectedly_ 

agricultural practices and growing conditions (Stohs et al„ 1997, and Smith 
1997). Smoke cadmium yield for brand style (V8) in the study was beyond the 
predictive interval. This could be the result of unusual growing conditions for a 
particular crop year or years. Although tobacco blending over several crop years 

Deleted: does 

^endsto mitigate year-to-year differences, the uncharacteristic change in_yield _ 

would, not be predicted in a benchmark study. One underlying jaremise of_ 

benchmarking is that brands are fairly consistent over time. ^wauaer et al. 

(2002) compared smoke constituent yields for over 30 U.S. commercial cigarette 

\ Deleted-. ) 

brands for sampling years 1995, 1998, and 2000. Among their findings was _ 

smoke constituent yields of commercial U.S. cigarettes had remained fairly 
constant between 1995 and 2000. 

The PE’s for each constituent in Table 6 are comparable in magnitude to the 
standard deviations of measured yields of the validation brands in most of the 
cases. Exploratory brand average yields, best-fit lines, 95% prediction intervals, 
and validation group average yields for each smoke constituent are shown in 
Figure 1. To exemplify the inherent variability, one standard deviation error bars 
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are included for several constituents. Standard deviation data are included for all 

of the constituents in Table 3. „_ 

(4) Brands with Carbon in Filter Design 

Nine brands in the exploratory group contained carbon in their filter 

, \ Deleted: ] 

construction. It is well known that carbon , i.e.. activated charcoal. Ji as the_ 

potential to selectively adsorb many vapor phase constituents from mainstream 
smoke (Baggett and Morie, 1975, and Williamson, et al., 1965). Carbon 
monoxide and oxides of nitrogen are generally not affected by carbon. Brands 

with carbon-containing filters in this study therefore would be expected to_ 

influence the functional regression relationships of vapor phase constituents with __ 

, \ Deleted: of 

a bias toward lower predicted yields for.non-carbon filter brands. Similarly, some . / ' 
vapor phase constituents would be over-predicted for brands with filters 
containing carbon. This influence was reflected in both the quality of the 
regression relationships (regression diagnostic statistics) and prediction interval 
width. Figure 2 demonstrates the differences between brands with carbon in 
filters and those without for benzene yields. The lack of carbon influence on 
particulate phase benzo[a]pyrene yield, for example, is also seen in this figure for 
the brands in this study . 

Functional relationship models for vapor phase constituents were 
consequently expanded to include a factor for the presence of carbon in filters. 

Multiple regression relationships were developed using tar and an “indicator” 
variable of 1 or 0 for the presence or absence of carbon, respectively, for the 39 
exploratory brands. Again, regressions were weighted with the inverse of tar 
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yield. The results showed general reductions in root mean square. errors, and 
increases in R 2 for most of the vapor phase constituents. Diagnostic statistics for 
wjpjghted regression v with and without the 'indicator' variable^ are compared jn _ 
Table 7. Use of an “indicator” variable for filters with carbon does not take into 
account different filter construction, different weights of carbon in carbon filters, 
or the surface areas and activities of carbons used in filters for various brands. 
Multiple regression adds a level of complexity to both . relationship development 
and to application of those relationships for predicting yields of other brands. 

The potential impact of carbon filters should be considered in assessing brand 
constituent yields and predicting relationships for vapor phase constituents. The 

benchmarking approach :would b e equally valid if brands wjth carbon in filters_ 

were modeled separately from those without carbon in future studies. Figure 3 is 
a comparison between measured smoke yields and linear and multiple 
regression predicted yields for benzene and acrolein, as examples. With simple 
linear regression, there is one predicted yield line. Predicted yields from multiple 
regression with tar and a carbon filter factor are not on a single line. The lower 
predicted yields are for brands with carbon, as the coefficient for the carbon 
indicator variable is always negative. It is seen that there is less over-prediction 
{predicted yield higher than observed) for yields below the linear model and less 
under-prediction (predicted yields less than observed) of yieids above the linear 
model line with multiple regression. The grouping for predicted yields for brands 
without carbon is higher than predicted yields using linear regression with carbon 
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only. In general, multiple regression with a carbon factor resulted in lower 
prediction errors for vapor phase constituents of most brands. 

(51 Mainstream TSNA and Nitrogen Oxides Treatment 

Many of the chemical constituents found in smoke are generated during 
combustion from precursors found in the leaf. For example, phenolic compounds 
are derived partially from cellulosic materials in leaf, some nitrogen compounds 
are derived partially from proteins in leaf, nitrogen oxides are derived partially 
from nitrates in leaf, and carbon oxides come from many leaf precursors through 

^ {Deleted: 2002 

combustion (Baker, _1999) . The amounts of these constituents found in_X 

mainstream smoke depend in part on the amounts of their precursors in the 
tobacco that is consumed during puffing and the conversion efficiencies of the 
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precursors. Other smoke constituents ar^distilled intact from the tobacco to_ 

smoke during the smoking process. Mainstream smoke constituents such as 
nicotine (Gorrod et al. 1999) and, to some extent, TSNA are in this category 
(Fisher et al. 1990) . The amounts of these constituents found in mainstream 
smoke depend on their concentrations in the tobacco that is consumed during 
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puffing,and their transfer efficiencies to smoke._X 

As noted earlier, two classes of mainstream smoke constituents, the nitrogen 
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oxides and,TSNA were weakly correlated with [SO tar, nicotine, or carbon_ 
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monoxide. Therefore, tp bacco filler nitrate and_TSNA L concentrations were^psed - {Deleted: therefore 

to modify functional relationships for predicting the yields of nitrogen oxides and 
TSNA in mainstream smoke. Filler concentrations of nitrate and TSNA for each 
brand style in this study are found in Table 8. 
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For TSNA, multiple regression with^SO tar andfiller JSNA concentration was_ 
first assessed. The TSNA cigarette concentration was calculated from the dry- 
weight basis (filler less water content) cigarette tobacco weight and the dry- 
weight basis concentration of the component in filler. Multiple regression 
relationships for NNN, NNK, NAT, and NAB were significantly improved over 
simple linear regression models as evidenced by higher respective R 2 ’s, lower 

, \ Deleted: standard _] 

root mean square .e rrors, and lower prediction errors for both the exploratory and - f Deleted: estimates j 

validation brands. Examination of multiple regression residuals revealed that 
brands lower in both tar yield and filler TSNA concentration tended to be under¬ 
predicted. Brands with either higher tar yields and lower filler TSNA 


concentration or brands with lower tar yields and higher filler TSNA concentration 
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was considerably less significant than the coefficient for the interactive variable, 
The final relationship used for prediction was a simple linear regression of smoke 


TSNA yields with the transformed, interactive variable ^(tar x cigarette filler 


TSNAL Residualswere homoaeneouslv and normallvdistributed over the 

transformed variable ranae. The result was a significant increase in regression 
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intervals and simpler visualization. NAB was an exception, however. 

Differences between multiple and simple regressions with transformed variables 
were minimal. Of the four TSNA analyzed, prediction errors were relatively 
greater for NAB. Figure 4 contains transformed variable regression models and 

, { Deleted: 

prediction intervals for mainstream ^moke NNN,_NNK, NAT. and NAB._ 

A similar approach was used for regression model development for nitric oxide 



and nitrogen oxide yields. Prediction model quality between multiple regression 
using tar and cigarette nitrate concentration and simple linear regression using a 

/ 

transformed interactive term variable, V(tar yield x cigarette nitrate concentration^.,'" 
was similar. Prediction models for nitric oxide and nitrogen oxides are found in 

, { Deleted: minimized 

Figure 5. Prediction errors were reduced. relative to simple linear regression_,'' 

using only tar as the independent variable. The use of prediction models beyond 
simple linear regression for TSNA or nitrogen oxides would have less impact if all 
brands were more similar in filler concentration of nitrate and TSNA. The fact 
that this study included brands from a wide range of regional markets, and 
consequently a wide range of concentrations of these tobacco components, 
warranted different regression model approaches. 

(61 Relative Prediction Errors 

To capture the overall impact of the different regression models on prediction 
errors for the validation brands, an average absolute relative prediction error 
(average ARPE) was calculated for each constituent. This was simply the sum of 
absolute relative prediction errors for each brand averaged over the nine 
validation brands. The relative prediction error for a constituent yield in a 
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particular validation brand is the difference in measured and predicted yields 
divided by the measured yield. The average ARPE was then compared to the 
coefficient of variation, %CV, for each constituent averaged over the nine 
validation brands. These averaged values are found in Table 9. For vapor and 
particulate phase constituents, prediction errors were generally within two 
standard deviations of the measured yield. Average ARPE’s were therefore 
approximately two times the respective average %CV. It should be recognized, 
with smoke constituents measured at the mrcrogram and nanogram levels, that 
very small differences in predicted and measured yields will result in large 
ARPE's, For example, the average ARPE for the nine brands for phenol and 
pyridine were 22% and 21%, respectively. Those percentages would equate to 
average prediction errors of 2.1 micrograms and 1.8 micrograms. An ARPE of 
21% for NNK yields would equate to an average prediction error of 10 
nanograms. If the average predicted error for 4-aminobiphenyl were 1.0 
nanograms, the ARPE would be 50%. 
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DISCUSSION 


A benchmarking approach for obtaining reliable cigarette smoke constituent 
machine yield information was evaluated with a worldwide sample of commercial 
brands. The approach was applied to smoke yield data obtained at standard ISO 
smoking conditions. Results for the diverse range of brands supported the 
underlying assumption that smoke constituent precursors in tobacco were 
available somewhat uniformly across tobacco blends from the different regions. 
Precursors include, for example, tobacco proteins, cellulose and lignin materials, 
and nitrogen-containing compounds. Mainstream smoke constituent yields 
increased with smoke tar, nicotine, or carbon monoxide yields. Smoke tar is 
directly related to tobacco consumed during puffing, which in turn is dependent 
on cigarette design. 

Benchmarking is based on two key elements. Cigarettes for which smoke 
constituent yield predictions are to be made must have design and tobacco blend 
characteristics within the boundary established by the benchmarking set of test 
brands. .This particular study was for Phjljp Morris brands manufactured in 
various regions. Extrapolation to other brands for which brand design, quality, 
and consistency are not known could be misuse of the regression relationships. 
Interpolation beyond the design range could lead to predicted yields with errors 
of unknown magnitude. The second key element for benchmarking is the single¬ 
laboratory study design. Since there are currently no universally recognized 
smoke constituent analyses methods, it was important to use the same 
laboratory and methods for a particular constituent analysis. The single 
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laboratory approach in which quality controlled and validated methods are used 
is a viable option for maintaining data consistency. However, the accuracy of the 
test methods in a single laboratory approach can not be determined in the 
absence of standardized methods. 

The benchmark approach utility is determined by how closely predicted yields 
compare to measured yields. Prediction errors for smoke constituents of the 
validation brands were moderate in this study when examined relative to other 
sources of potential variation. One source is the point-in-time variation in a 
single set of sample replicates, as evidenced by the magnitude of the analysis 
standard deviation. Additional variation is inherent in more subtle day-to-day 
fluctuations in instrument calibrations, technician performance, and smoking 
machine operations. Testing laboratories practicing sound quality control 
procedures will minimize, but not eliminate, these analysis variations. Variability 
in method execution over time was estimated by comparing average constituent 
yields for the 1R4F Kentucky reference cigarette reported by the contract lab for 
several company studies (Table 10). Analyses were performed over an 11- 
month period with the laboratory using the same methods throughout. 
Consistency of the reference cigarette was assumed. Average 95% confidence 
intervals for standard smoke parameters (tar, nicotine, puff count, etc.) were 
approximately +/- 7% of parameter means. For mainstream smoke constituents, 
the average 95% confidence intervals for the mean yields were considerably 
wider. Vapor phase and particulate phase constituent average 95% confidence 
intervals were +/-18% and 22% of mean yields, respectively. These confidence 
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intervals were of similar magnitude to most of the prediction errors in the current 
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benchmark study. 

A second source of variation in smoke constituent yields is the cigarette brand 
itself. Three potential sources of cigarette variability are described in Annex C of 
ISO Standard 8243 (ISO, 1991). Short-term variability is attributed to minor 
fluctuations in tobacco and components around targets during manufacture. 
Medium-term variability reflects short-term variability and also batch-to-batch 
changes in components, tobacco grades, and machinery performance. Long¬ 
term variability can result from crop year differences or machinery replacement, 
for example, and are additive with short- and medium-term variability. As 
previously noted, natural products such as tobacco are subject to crop year 
weather variations, regional agricultural practices, and tobacco processing 

differences. Tobacco blending from different crop years jninimizes these _. 

differences, but some medium- and long-term variation is expected. For this 
reason, a smoke constituent yield measured for a brand produced today could 
vary/rom theyield for the same brand prqduced_at another time. An example of , 
typical smoke constituent yields variation for a single brand produced over a 
period of time is seen in Table 11. The brand was tested monthly over a nine- 
month period by an internal research laboratory. The measured yield variation 
encompasses short- and medium-term brand variation and laboratory method 
variation over time. A significant point of the information in Table 11 is the 
magnitude of the 95% confidence intervals around the mean constituent yields. 

The average 95% confidence interval for all smoke constituents analyzed was +/- 
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26% of the means for this time period. These intervals are only moderately 
greater than the confidence intervals around the means for the 1R4F reference 
cigarette tested by the contract laboratory over time. The intervals are also 
similar in magnitude to the average relative prediction errors for the validation 
brands in this current benchmark study. These two examples of smoke 
constituent yield variations support the premise that the predicted yields and 
prediction intervals, generated from an appropriately designed benchmark study, 
are valid options for providing smoke constituent yield information for 
conventional cigarette brands represented in that study design. The 95% 
prediction intervals generated from the regression functions provide reasonable 
expectations of the variation for brands at some average tar, nicotine, or carbon 
monoxide yield level. 

To conduct benchmark studies in a multiple laboratory setting, the impact of 
the lack of standardized methods is immediately clear. Purkis, et al. (2001) 
compared ISO smoke constituent yields for three brands tested by six 
laboratories. Each laboratory used its own internally validated methods. All 
laboratories reported the same relative ranking of constituents yields for the 
brands but there was significant variation between laboratories for particular 
constituent yields. Ratios of highest to lowest reported yields ranged from 1.09 
(tar) to 3.20 (quinoline). The authors postulated that, on average, differences 
greater than 60% between laboratories would be required for confidence that 
observed yield differences were statistically significant. In contrast, within- 
laboratory variation for constituent yields was significantly lower with average 
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coefficients of variation of 9%. Data reported in Table 12 reinforce the 
conclusions of Purkis. Smoke constituent yields for the 1R4F reference cigarette 
reported by two contract laboratories (point-in-time measurement) and from 
several literature sources were compared. The ratios of maximum to minimum 
reported constituent yields ranged from moderate (approximately 1.1) to 
significant (greater than 2.0). These examples illustrate the need for caution in 
comparing smoke constituent yiefd data from different testing sources. 

_A benchmark approach can provide predicted smoke constituent yield 
information as demonstrated by this study. A limitation of benchmarking is the 
inability to reliably predict constituent yields for brands with some unusual 
characteristic not anticipated from the design range of the model brands. This 
limitation would be minimized by successively adding new brands to the 
benchmarking model brands over time and including a validation set of brands as 
an internal check of the functional relationships. Benchmarking is useful only to 
the extent that the test brands are representative of brands for which yield 
predictions are to be made. Prediction errors cannot be eliminated. The 
magnitudes, however, should be in the range of expected method and brand 

variation provided the basic elements.o f a benchmarking approach are_ 

recognized. These basic elements includes cigarette brands with design 
features representative of the marketplace; the predicted brands with design 
features within the ranaes defined: brand quality consistency over time: and 
single testing laboratory with consistent method performance. 
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Several regression techniques were used to address non-constant variance. The first, 
square-root transformations of dependent variables (smoke constituent yields) or 
dependent variable (tar, nicotine, or carbon monoxide), had marginal effect on residuals 
distributions. 

Another approach to addressing residuals was 
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In order to correct for the deviation from the basic assumptions, the approach o 
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It is used when a regression model is to be fitted to a data set with non-constant error 
variance. 
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